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(54) Microcomputer with packet bus 

(57) There is disclosed a computer system including 
a microprocessor on a single integrated circuit chip com- 
prising an on-chip CPU and a communication bus pro- 
viding a parallel communication path between the CPU 
and at least one of the module with logic circuitry. The 
integrated circuit device further comprises an external 
communication port connected to the bus, having an in- 



ternal parallel format for connection to the bus. The ex- 
ternal port further has an external signal having an ex- 
ternal format less parallel than the internal format. 
Translation circuitry is provided to effect conversion be- 
tween said internal and external formats. There is also 
disclosed a method of operating such a computer sys- 
tem. 
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Description 

The invention relates to microcomputers 

Single chip microcomputers are known including external communication ports so that the chip may be connected 
5 in a network including tor example connection to a host microcomputer for use in debugging routines. Such systems 
are known in which each of the interconnected microcomputer chips has its own local memory. For speed of commu- 
nication on on-chips it is common tor bit packets to be transmitted between modules on a chip in a bit parallel forma:. 
However problems arise in both power consumption and available pin space in providing for external off-chip commu- 
nications in the same parallel bit format as that used on-chip. Such microcomputers require access to instruction or 
10 code sequences and for efficient operation it is desirable for the instructions to be retrievable from locations within the 
address space of the CPU- 
it is an object of the present invention to provide an improved microcomputer and an improved method of operating 
a microcomputer system, in which external communications are simplified and instructions can be obtained from lo- 
cations off -chip. 

75 The invention provides a computer system comprising a microprocessor on a single integrated circuit chip con- 

nected to an external computer device said integrated circuit chip having an on-chip CPU with a plurality of registers 
and a communication bus providing a parallel communication path between said CPU and a first memory local to the 
CPU ; said integrated circuit furth er comprising an external communication port connected to said bus on the integrated 
circuit chip, said external computer device being connected to said external communication and having a second 

20 memory local to said external computer device, said port having an internal connection to the bus of an interna! parallel 
signal format and an external connection of an external format less parallel than said internal format, said second 
memory being accessible by said CPU through said port, said port forming part of the memory address space of said 
CPU from which instructions may be fetched, whereby said port may be addressed by execution of an instruction by 
said CPU. 

2S Preferably translation circuitry is arranged to translate bit packets between an on-chip bit parallel format and an 

external bit serial format. 

!\ Preferably said on-chip CPU includes circuitry for generating bit packets including a destination identifier within 
/ ! each packet, said external communication port having translation circuitry to translate bit packets between said internal 
/ and external formats while retaining identification of said destination. 
30* Preferably said first and second memories each have addressable locations with addresses within the address 

space of said on-chip CPU and said translation circuitry is arranged to generate packets of said external format including 
an address within said second memory. 

Said first memory may have software for execution by said on-chip CPU and said second memory may have 
software for execution by said on-chip CPU in a debugging routine for said on-chip CPU. 
35 Said second memory may have software for execution by said external computer device in a debugging routine 

for said on-chip CPU. 

Preferably said single integrated circuit chip has a plurality of CPUs on the same chip each connected to said 
communication bus whereby each CPU on said chip may address said external port. 

Preferably said on-chip CPU includes pointer circuitry for identifying the location of a next instruction for execution 
40 by the CPU and said pointer circuitry is operable to point to an address in said second memory. 

The invention includes a method of operating a computer system comprising a microprocessor on a single inte- 
grated circuit chip connected to an external computer device, said integrated circuit chip having an on-chip CPU with 
a plurality of registers and a communication bus providing a parallel communication path between said CPU and a first 
memory local to the CPU ; said integrated circuit further comprising an external communication port connected to said 
45 bus on the integrated circuit chip, said external computer device being connected to said external communication and 
having a second memory local to said external computer device, which method comprises transmitting bit packets on 
said ous with an internal parallel signal formal translating said packets in said port to an external connection of an 
external formal less parallel than said internal format, accessing said second memory by said CPU through said port, 
said port forming part of the memory address space of said CPU, and addressing said port by execution of an instruction 
50 by said CPU and thereby fetching an instruction from said second memory through said port. 

Preferably bit packets are generated with a destination identifier within each packet said external communication 
port translating bit packets between said internal and external formats while retaining identification of said destination 

Preferably said translation of bit packets is between an on-chip bit parallel format and an external bit serial format. 

In one arrangement said first memory has software executed by said on-chip CPU and said second memory has 
55 software executed by said on-chip CPU in a debugging routine for said on-chip CPU. 

Alternatively or additionally said second memory has software executed by said external computer device in a 
debugging routine for said on-chip CPU. 

Preferably said on-chip CPU includes pointer circuitry for identifying the location of a next instruction lor execution 
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by the CPU and said pointer circuitry is loaded with a pointer value pointing to an address in said second memory. 

An embodiment of the present invention will now be described by way of example with reference to the accompa- 
nying drawings in which: 

Figure 1 is a block diagram of a microcomputer chip in accordance with the presen; invention. 

Figure 2 shows more detail of a debug port of the microcomputer of Figure 1 , 

Figure 3 shows input of a digital signal packet through the port of Figure 2, 

Figure 4 shows the output of a digital signal packet to the port of Figure 2, 

Figure 5 shows accessing of registers in the port of Figure 2 t 

Figure 6 shows the format ol a digital signal request packet which may be used in the microcomputer of Figure 1 , 
Figure 7 shows the forma: of a digital signal response packet which may be used in the microcomputer of Figure 1 : 
Figure 8 shows one example of a serial request packet which may be output or input through the port of Figure 2, 
Figure 9 illustrates further details of one CPU of the microcomputer of Figure 1 including special event logic, 
Figure 10 shows further detail of the special event logic of Figure 9, 

Figure 1 1 shows a microcomputer of the type shown in Figure 1 connected to a host computer for use in debugging 
the CPU by operation of the host : 

Figure 1 2 shows an arrangement similar to Figure 11 in which a second CPU is provided on the same chip and 
operates normally while the other CPU is debugged by the host, 

Figure 13 illustrates one CPU forming part of a microcomputer as shown in Figure 1 when connected to a host 
computer for use in watchpoint debugging, 

Figure 14 shows a microcomputer of the type shown in Figure 1 connected to a host computer in which one CPU 
on the microcomputer is debugged by the other CPU on the same chip, 

Figure 15 shows more detail of part of the logic circuitry of Figure 10 : 

Figure 16 shows more detail of part of the logic circuitry of Figure 15, and 

Figure 17 shows more detail of another part of the logic circuitry of Figure 15. 

The preferred embodiment illustrated in Figure 1 comprises a single integrated circuit chip 11 on which is provided 
two CPU circuits 1 2 and 1 3 as well as a plurality of modules 1 4. The CPU's 1 2 and 1 3 as well as each module 1 4 are 
interconnected by a bus network 15 having bi-directional connections to each module In this example the bus network 
is referred to as a P-link consisting of a parallel data bus 20 as shown in Figure 2 together with a dedicated control 
line 21 provided respectively for each module so as to link the module to a P-link control unit 22. Each module is 
provided with a P-link interlace 23 incorporating a state machine so as to interchange control signals between the 
respective P-link control line 21 and the interface 23 as wel, as transferring data in two opposing directions between 
the data bus 20 and the inierface 23. 

In the example shown in Figure 1 . the various modules 14 include a video display interface 25 having an external 
connection 26, a video decode assist circuitry 27 : an audio output interface 23 having an external connection 29, a 
debug port 30 having an external connection 31. an external memory interface 32 having an external bus connection 
33 leading to an external memory, clock circuitry 34, various peripheral interfaces 35 provided with a plurality of bus 
and serial wire output connections 36, a network interface 37 with an external connection 38 as well as the P-link 
control unit 22. The two CPU units 12 and 13 of this example are generally similar in construction and each includes 
a plurality of instruction execution units 40 : a plurality of registers 41 an instruction cache 42 and a data cache 43. In 
this example each CPU also includes event logic circuitry 44 connected to the execution units 40 
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The CPU's can be operated in conventional manner receiving instructions from the instruction caches 42 on chip 
and effecting data read or write operations with the data cache 43 on chip. Additionally external memory accesses for 
read or write operations may be made through the external memory interface 32 and bus connection 33 An important 
provision in this example is the debug port 30 which is described in more detail in Figures 2 to 5. As shown in Figure 
s 2, this circuitry includes a hard reset controller 45 connected to a hard reset pin 46. The controller 45 is connected to 
all modules on the chip shown in Figure 1 so that when the hard reset signal is asserted on pin 46 all circuitry on the 
chip is reset. 

As will be described below : this port 30 provides an important external communication for use in debugging pro- 
cedures. The on-chip CPU's 12 and 13 may obtain instruction code for execution from an external source communi- 

70 eating through the port 30. Communications on the P-link system 15 are carried out in bit parallel format. Transmissions 
on the data bus 20 of the P-link 15 may be carried out in multiple byte pa ckets, for example 35 bytes for each packet, 
so that one packet is transmitted in five consecutive eight byte transfers along the P-link each transfer being in bit 
parallel format. The port 30 is arranged to reduce the parallelism of packets obtained from the P*link 15 so that they 
are output in bit serial format through the output 31 or alternatively in a much reduced parallel format relative to that 

*5 used on the P-link 1 5 so as to reduce the number of external connection pins needed to implement the external con- 
nection 31 . 

The structure of the port 30 will now be described with reference to Figures 2 to 5. 

In this example the port 30 comprises an outgoing packetising buffer 50 connected to the P-link interface 23 as 
well as an incoming packetising buffer 51 connected to trie interface 23. On the output side, the external connection 

20 31 is in this case formed by an output pin 52 and an input pin 53. The port in this case effects a full transition between 
parallel format from the data bus 20 to bit serial format for the input and output pins 52 and 53. The pins 52 and 53 
are connected as part of an output link engine 55 which also incorporates serialtser 56 and deserialiser 57 connected 
respectively to the outgoing packetising buffer 50 and the incoming packetising buffer 51. Between the buffers 50 and 
51 are connected by bidirectional connections a register bank 53 and a port state machine 59. The function of the port 

2B 30 is to translate bit packets between the internal on-chip parallel format and the external bit serial format. In addition 
it allows packets which are input through pin 53 to access the registers 58 in the port without use of the P-link system 
15. Equally packets on the P-link system 15 can access the registers 58 of the port without using the external pins 52 
or 53. 

The format of the multibit packets used in the microcomputer system are illustrated by way of example in Figures 

30 6.7 and 8. When a packet is to be output from the port 30 from one of the modules 1 4 connected to the P-link 1 5, the 
module transmits the parallel representation of the packet along the data bus 20. The packet may comprise a plurality 
of eight byte transfers as already described. Each mod_uieJ4Hncluding the port 3 Q,, have a similar P-link interface 23 
andjhe opexattan-totake daja^rorrithe bus 20 or to put data ontoJhe Jpus 20 is similar for each . When a module has 
a packet to send to another module, for example to the port 30, it first signals this by asserting a request signal on line 

35 60 to the dedicated link 21 connecting that module to the central control 22. it also outputs an eight bit signal on a 
destination bus 61 to indicate to the control the intended destination of the packet it wishes to transmit. It will be 
understood that the P-link 21 is itself a bus. A module such as the port 30, which is able to receive a packet from the 
bus 20 will assert a signal "grant receive" on line 62 to be supplied on the dedicated path 21 to the central control 22 
regardless of whether a packet is available to be fed to that destination or not. When the central control 22 determines 

40 that a module wishes to send a packet to a destination and independently the destination has indicated by the signal 
on line 22 that it is able to receive a packet from the bus 20, the control 22 arranges for the transfer to take place. The 
control 22 asserts the "grant send" signal 63 via the dedicated line 21 to the appropriate interface 23 causing the 
sending module to put the packet onto the P-link data path 20 via the bus 64 interconnecting the interface 23 with the 
data bus 20. The control 22 then asserts the "send" signal 65 of the receiver which signals to it that it should accept 

45 the transfers currently on the P-link data bus 20. The packet transmission concludes when the sender asserts its "end 
of packet send" line 66 concurrently with the last transfer of packet data on the bus 20. This signal is fed on the dedicated 
path 21 to the central control 22 and the control then asserts the "end of packet received" signal 67 to the receiving 
module which causes it to cease accepting data on the P-link data bus 20 after the current transfer has been received. 
The parallel to senal translation which takes place in tne port 30 has a one to one equivalence between the parallel 

so and serial packets so that all data contained in one packet form is contained in the other. The translation therefore 
involves identifying the type of the packet and copying across fields of the packet in a manner determined by the type. 
When a packet is input to the outgoing packetising buffer 50 from the data bus 20, the packet is held in its entirety as 
the buffer is 35 bytes long in order to hold the longest packet. As shown in Figure 4, buffer 50 is connected tothe port 
state machine 59 and to a shift register 70 by a transfer bus 71 . The shift register 70 is connected to the serialiser 56 

55 The state machine 59 provides input signals 72 to the buffer 50 to copy specific bytes from the P-link packet onto the 
transfer bus 71 under the control of the state machine 59. Firstly the most significant byte of the packet, which holds 
the destination header 73, is placed onto the byte wide transfer bus 71 The state machine 59 compares this value 
wrth those values which indicate that the packet is destined for the shift register and output serial link. If the packet is 
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destined to' the output serial link, the state machine causes the next byte 74 of the packet (which is the operation code 
indicating the type of packet) to be placed on the transfer bus 71 . From the opcode 74 which is supplied to the state 
machine 59 on the transfer bus 71 . the state machine determines the length and format of the packet derived from the 
data bus 20 and therefore determines the length and format of the serial packet which it has to synthesise The state 
s machine 59 outputs a byte which indicates the serial length packet onto the transfer bus 71 and this is shifted into the 
first byte position of the shift register 70. The state machine 59 then causes bytes to be copied from the buffer 50 onto 
the bus 71 where they are shifted into the next byte position in the shift register 70. This continues until all the bytes 
from the buffer 50 have been copied across. The order of byte extractions from the buffer 50 is contained in the state 
machine 59 as this determines the reformatting in serial format. The serial packet may then be output by the output 

io engine 55 via pin 52 to externally connected circuitry as will be described with reference to Figures 11 to 14. 

When a serial packet is input through pin 53 to the port 30, the translation is dealt with as follows. Each byte is 
passed into the shift register 80 forming a packelising buffer. Such a serial packet is shown in Figure 8 in which the 
first byte 8 1 indicates the packet size. This will identify the position of the last byte of the packet. Referring to Figure 
3, the register 80 copies bytes in the simple order they are shifted out of the shift register onto a transfer bus 83 under 

is the control of the state machine 59. The state machine 59 compares the destination byte 84 of the packet with those 
values which indicate that the packet is destined for the P-link system 15. The state machine 59 causes the next byte 
85 of the packet to be placed on the transfer bus in order to indicate the type of packet (also known as the opcode) 
and from this the state machine checks the length and format of the serial link packet and those of the P-link packet 
which it has to synthesise The state machine 59 causes bytes to be shifted out of the register 80 onto bus 83 where 

20 they are copied into a P-link packet buffer 51 . This continues until all serial link bytes have been copied across and 
the positions in which the bytes are copied into the buffer 86 from the shift register 80 is determined by setting of the 
state machine 59. This indicates to the interface 23 that a packet is ready to be put on the bus 20 and the interface 
communicates through the dedicated communication path 21 with the central control 22 as previously described. When 
the p.Hnk system 1 5 is ready to accept the packet the interlace responds by copying the first eight bytes of the packet 

25 onto the data path 20 on the following cfock cycle (controlled by clock 34). !t copies consecutive eight byte parts of the 
packet onto the bus 20 on subsequent clock cycles until ail packet bytes have been transmitted. The final eight bytes 
are concurrent with the end of packet send signal being asserted by the interface on line 66 

As already described, an incoming packet (either parallel or serial) to the port 23 may wish to access port registers 
56. When the destination byte 84 of an incoming serial bit packet from the pin 53 indicates that the packet is destined 

30 to access registers 58, the bit serial packet is changed to a P-link packet in buffer 51 as already described but rather 
than being forwarded to the P-link interface 23 : it is used to access the register bank 58. One byte (the opcode) of the 
packet will indicate whether the register access is a read or write access. If the access is a read ; then the state machine 
59 will output a read signal on line 90 shown in Figure 5. Concurrent with this the least significant four bits of the packet 
address field are placed on lines 91. Some cycles later the register bank 58 under control of a control block 92 will 

35 copy the value in the addressed register onto the data bus 93 one byte at a time, each byte on a successive clock 
cycle Each byte on the data line 93 is latched into the outgoing buffer 50 and under control of the state machine 59 ; 
the data read from the register is synthesised into a P-link packet in buffer 50 and specified as a "load response". The 
j destination field for this response packet is copied from a "source" field of a requesting bit serial packet._Air^saciion 
' J identifigrJTID) which is also provided in each packet, is also copied across. A type byte of the response packet is 

40 j formed from the type byte of the request packet and consequently a response P-link packet is formed in the outgoing 
buffer 50 in response to a request packet which was input from an external source to pin 53. 

II the type of access for registers 58 is a write access then the write line 95 is asserted by the state machine 59 
together with the address line 91. Some cycles later the least significant byte of the data is copied from an operand 
field of the packet in buffer 51 onto the data bus 93. On the following seven cycles bytes of successive significance 

45 are copied to the recisters 58 until al! eight bytes have been copied. A response packet is then synthesised in register 
50 except tnat "store response" packets do not have data associated with them and comprise only a destination byte : 
a type byte and a transaction identifier byte. This response packet is translated into a bit serial response packet as 
previously describee, loaded into shift register 70 and output through pin 52 to indicate to the source of the write request 
that a store has been effected. 

50 Similarly if the destination byte of a packet received from the P-link system 15 by the port 30 is examined and 

indicates that the packet is destined to access registers 58 in the port 30, a similar operation is carried out. Rather than 
being forwarded to the bit serial register 70 ; the type of field of the packet is used to determine whether the access is 
a read or write access. If the access is a read then the read line 90 of Figure 5 is asserted by the state machine 59 
and the least significant four bits of the packets address field are placed on the address line 91 Two cycles later the 

55 register bank copies the value held in the register which has been addressed onto the cata line 93 one byte at a time 
each on successive cycles. This is latched into buffer 51 and the state machine synthesises a P-link packet which is 
specified as a "read response" packet The destination field for this response packet is copied from the source field of 
the requesting bit serial packet. The transaction identifier is also copied across. The type byte of the response packet 
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is formed frorr. the type byte of the request packet 

li the type of access required is a write access then state machine 59 asserts the write line 95 together with the 
address line 91 . Some cycles later the least significant byte of the data is copied from the operand field of the packet 
in buffer 50 to the data line 93 On the following seven cycles bytes of successive significance are copied to the data 
lines 93 and copied into the registers until al! bytes have been copied. A response packet is then synthesised as 
previously described except that "store response" packets do not have data associated with them and comprise only 
a destination byte, a type byte and a transaction identifier byte. This response packet is then forwarded to the P-iink 
interface 23 where it is returned to the issuer of the request packet which have been input through the P-link interface 
93 in order to access the port registers 56. 

From the above description it will be understood that the packet formats shown in Figures 6, 7 and 8 include 
packets that form a request ora response to a read or write operation Jn addition to each packet including a destination 
indicator for the packet ( numeral 73 in Figures 6 and 7 or numeral 84 in Figure 8) the packets : in elude a (TjDJJjransaclion 
ideTntneTSEand an indicaJTonof the sou^ce99. The packets may need to identify a more specific address at a desti- 
nation.' FoV th¥reason a'n'address^ind'icator 1 00 may be provided. As already described in relation to register access 
at the port 30. the destination identifies the port although the address 100 is used to indicate the specific register within 
the port. The Destination field is a one byte field used to route the packet to the target subsystem or module connected 
to the P link 1 5. For request packets it is the most significant byte of the address to be accessed. For a response packet 
it identifies the subsystem which issued the request. The source field is a one byte field which is used as a return 
address for a response packet. The Address field is provided by the least significant 3 bytes of the request address. 
TheTI D field is used by the requester to associate responses with requests . 

it will be appreciated that by using a bit serial port low cost access is provided to a chip, requiring only a small 
number of pins for access, and may be particularly used for debugging a CPU by use of an external host. 

in this example each CPU 12 and 1 3 is arranged to execute an instruction sequence in conventional manner. The 
instruction set will include a plurality of conventional instructions for a microcomputer but this example also includes 
an instruction to send an "event". An "event" is an exceptional occurrence normally caused by circumstances external 
to a thread of instructions Events can be used to have similar effect as an "interrupt" or "a synchronous trap" Events^ 
may be prioritised in that they can cause a d iange in the priority level at which Jhej CPU executes . An event may be 
"^e7fT5y^el^cl3t7C^ hardware in the form of thelvent logic 44 can carry out the function 

of some events without the execution of instructions in a service or handler routine. 

Events which originate from execution of an instruction by a CPU are caused by execution of the event instruction. 
This can be used to send an "event" to a CPU such as one or other of the CPU's 12 or 1 3 on the same chip or it may 
be used to send an event to a CPU on a different chip through an external connection. The CPU which executes the 
event instruction may also send an event to a further module connected to the P-link system 1 5. The event instruction 
has two 64 bit operands, the event number and the event operand. With regard to the event number 0-63, bit 15 is 
used to determine whether or not the event is a "special event". When bit 15 is set to 1, bits 0-14 are used to define 
the type of special event. Bits 16-63 of the event number are used to identify the destination address of the CPU or 
module to receive the special event. The types of special event are set out below: 



Event Name 


EN. CODE 


EN. OPERAND 


Function 


EVENTRUN 


1 


Ignored 


Resumes execution Irom suspended state of 
the receiving CPU 


EVENT RESET 


3 


Ignored 


Generate a reset event on the receiving CPU 


tVENT.SUSPEND 


5 


Ignored 


Suspends execution of the receiving CPU 


EVENT SET RE SET. HANDLER 


7 


ooot address 


RESET. HANDLER SHADOW*- RESET 
HANDLER RESET.HANDLER <- boot address 



These special events may be sent from one CPU 1 2 or 1 3 to the other or alternatively they may be sent through 
the debug port 30 from an external host to eitner of the CPU's 1 2 or 1 3 on chip. The "event" will be sent as a bit packet 

of the type previously described. 

h response to a special event, either CPU 12 or 1 3 can be made to cease fetching and issuing instructions and 

enter the suspended state 

When an EVENT.SUSPEND is received by a CPU it sets a suspend flag. This flag is OR-ed with the state of the 
suspend pin to determine the execution stage of the CPU. 
The suspended state may be entered by: 



6 



EP 0 840 221 A1 



• Asserting the SUSPEND PIN. This stops al. CPUs on the chip. 

• Sending an EVENT.SUSPEND to a CPU. Tnis suspends only the receiving CPU. 
5 The suspended state may be exited by either of: 

• Changing an external SUSPEND PIN from the asserted to negated stage. This causes all CPU{s) which do not 
have their suspend flags set to resume execution. 

w • Sending an EVENT. RUN special event to a CPU. This clears the suspend flag. If the SUSPEND PIN is negated 
this causes the receiving CPU to resume execution. 

Entering the suspended state causes a CPU to drain the execution pipelines. This takes an implementation defined 
period of time. While a CPU is suspended its execution context may be changed in any of the following ways: 

1S 

• The reset address control register RESET.H ANDLER may be changed. 

• The CPU may be reset. 

20 * Externa! memory may be changed by DMA ; e.g. using the debug link 30. 

At hard reset, (that is reset of all state on the chip) if the SUSPEND RN is asserted at the active edge of the hard 
reset the CPU(s) state will be initialized but will not boot. The CPUs will boot from the addresses contained in the 
RESET. HANDLER set prior to the reset event when they enter the running state. 

2B The EVENT. RESET causes the receiving CPU to perform a soft reset. This type of reset causes the key internal 

state to be initialized to known values while saving the old values in dedicated shadow registers such as to enable 
debugging software to determine the state of the CPU when the reset took place. 

The instruction execution system for CPU 12 or 13 and its relation with the special event logic unit 44 will be 
described with reference to Figure 9. In normal operations the CPU fetch and execute instruction cycle is as follows. 

30 A prcf etcher 101 retrieves instructions from the instruction cache 42 and the instructions are aligned and placed in a 
buffer ready for decoding by a decode unit 102. The decode unit 102 standardises the format of instructions suitable 
for execution. A despatcher circuit 103 controls and decides which instructions are able to be executed and issues the 
instructions along with any operands to the execution unit 1 04 or a load/store unit 105. The microcomputer chip of this 
embodiment has in addition the special event logic 44. This unit 44 can accept commands which originate from packets 

35 on the P-link system 15 through the interface 23 so as to override the normal instruction fetch sequence. On receipt 
of an "event suspend" packet the special event logic 44 will cause the prefetcher 101 to cease fetching instructions 
and cause the despatcher 103 to cease despatching instructions. The execution pipeline of instructions is flushed. A 
"event run 0 packet will cause the special event logic 44 to cause the prefetcher to resume fetching instructions provided 
the suspend pin is not asserted. In addition to stopping or starting normal execution instruction, the special event logic 

40 44 can cause the "instruction stream" state to be reinitialized by a soft reset which is initiated by software when the 
chip is already running and resets only some of the state on the chip. Furthermore a packet can overwrite the register 
which holds the address on which code is fetched following a reset operation. 

The special event logic 44 will now be described in greater detail with reference to Figure 1 0. 

Figure 10 shows the special event logic 44 connected through the link interface 23 to the P-lmk system 15. As is 

45 shown in more detail in Figure 10, the interface 23 is connected through a bus 110 to the special event logic 44 which 
comprises in more detail the following components An event handler circuit in which is connected by line 112 to the 
instruction fetching circuitry 101 and by line 113 to the instruction despatcher 103. The bus 110 is also connected to 
evenl logic circuitry 1 1 4 which has a bi-direclional communication along line 1 1 5 with the event handler circuit 11 1 The 
event logic circuitry 114 is connected with a bi-directional connection to counter and alarm circuitry 116 as well as a 

so suspend flag 117. A suspend pin 1 1 8 is connected to the event logic 1 1 4. A reset handler register 1 1 9 has a bi-dfrectional 
communication with the event logic 114 along line 120. It is also connected to a shadow reset handier register 121. 

The operation of the circuitry of Figure 10 is as follows. An instruction may be executed on-chip or be derived from 
operation of circuitry on an external chip ; which causes a packet to be transmitted on the P-link system 15 being a 
destination indicator identifying the module shown in Figure 10. In that case the packet is taken through the interface 

SB 23 along bus 110 to the event handler 111 and event logic 115. The event logic to determine whether the special event 
is "event run" or "event reset" or "event suspend" or "event set reset handler". 

On receipt of an "event suspend" the event logic 114 causes the suspend flag 1:7 to be set The event logic 114 
forms a logical OR of the state of the suspend flag 117 and the state of the suspend pin 118. The result is referred to 
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as the suspend state. 11 the arrival of the "event suspend" has not changed the suspend state then nothing further is 
done. It the arrival of the "event suspend" has changed the suspend state then the event logic 114 inhibits the accessing 
of instructions from the cache 42, it does this by a signal to the event handler 11 1 which controls fetching of instructions 
by the fetcher 101 and the despatch of instructions by the despatcher 103. Instructions fetched prior to receipt of the 
"event suspend" will be completed but the CPU associated with the event logic 114 will eventually enter a state where 
no instructions are being fetched or executed. 

On receipt of an "event run" the event logic 114 causes the suspend flag 117 to be cleared. The event logic 114 
performs a logical OR of the state ot the suspend flag 11 7 and the suspend pin 1 1 8. The result is known as the suspend 
state. If the arrival of the "event run" has not changed the suspend state then nothing further is done, if the arrival of 
the "event run" has changed the suspend state then tne event logic 114 ceases to inhibit access of instructions from 
the cache 42. A signal passed through the event handler 111 indicates to the fetcher 101 that the CPU should resume 
its fetch-execute cycle at the point at which it was suspended. 

In the event of receipt of an "event set reset handler" the event logic 114 causes the operand which accompanies 
the special event in the packet, to be copied into the reset handler register 119 and the previous value that was held 
in register 119 is put into the shadow reset handler register 121. 

On receipt ol an "event reset" the event logic 114 causes the event handler 111 to cease its current thread of 
execution by providing a new instruction point on line 1 1 2 to the fetcher 1 01 and thereby start executing a new instruction 
sequence whose first instruction is fetched from the address given in the reset handler register 1 99. That new address 
is obtained on line 1 20 through the event logic 1 1 4 to the event handler 1 1 1 prior to being supplied to the fetcher 101. 

II will therefore be seen that by use of the special events which may be indicated in a packet on the P-link system 
15, souices on-chip or off -chip may be used to suspend the fetching and execution of instructions by a CPU or to 
resume execution of a suspended CPU. It may also be used to reset a CPU into an initial state or to provide a new 
boot code for the CPU from anywhere on the P-link system or anywhere in an interconnected network using the external 
port 30 so that it forms part of the physical address space throughout the network which may be accessed by the CPU. 

More detailed Figures showing the special event logic 44 are provided in Figures 1 5 ; 16 and 17. Figure 1 5 shows 
the P-link system 1 5 including a Receive buffer 140 and a Transmit buffer 141 adjacent the interface 23 When a packet 
including a special event is received in the buffer 140, inputs may be provided on lines 142, 143 and 144 to special 
event decode logic 145. When bit 1 5 of the event number is set to 1 thereby indicating a special event, a P valid signal 
is provided on line 1 42 to the decode logic 1 45. At the same time the event code field of the packet is supplied on line 
143 to the decode logic 145 and the event operand field is supplied on line 144 to the decode logic 145. In response 
to assertion of the P valid signal on line 142, the decode logic 145 decodes the event code field as indicated in the 
following table: 



|| P„en.code 


Signal asserted 


Ev_handle 


| 001 


Ev_run 




011 


Ev_reset 




101 


Ev_Susp 




111 


Ev^set 


P_en.op 



On the cycle of operations following decoding., the decode logic 1 45 outputs a signal on line 146 P Event done to 
clear the buffer 140. Depending on the result ol decoding the signal on line 143, the decode logic may output either 
an Event Run signal on line 1 *7 or an Event Suspend signal on line 1 48 to suspend logic 1 49 connected to the suspend 
pin by line 150. Alternatively decoding of the signal on line 143 may cause the decode logic 145 to output an Event 
Reset signal on line 151 to the CPU pipeline circuitry 152. Alternatively the decode logic 145 may output an Event Set 
Reset Handler signal on line 153 to the reset handler logic 154 together with the operand value on bus 156. 

Figure 16 illustrates the suspend logic 149. Lines 147 and 148 form inputs to an SR latch 157 which provides a 
second input 158 to an OR gate 159 having the suspend pin providing the other input 150. In this way the signal on 
line 147 is logically or-ed with the suspend pin to generate a fetch disable signal on line 160 which includes a latch 
161 providing the suspend flag. The signal on line 160 has the effect of inhibiting the fetching of instructions from the 
instruction cache 42. This eventually starves the CPU of instructions and the CPU execution will be suspended. As- 
sertion of the signal on line 1 43 will clear any previously asserted signal on line 1 47 in the normal operation of the SR 
iatchl57. 

Figure 17 illustrates the reset handler logic 154. When the Event Set on line 153 is asserted, this is supplied to a 
reset handler state machine 1 62 connected to a register bus 163 interconnecting the reset handler register 11 9 : shadow 
reset handler register 121 and the instruction pointer bus 112. The response to assertion of signal 153 is as follows: 



8 



EP 0 840 221 A1 

-I ThestatG machine 162 asserts the read line 164 of the reset handler register 119 which causes the value in the 
reset handier register to be read onto the register bus 153. 

2 The state machine 162 asserts the write line 165 of the shadow reset handler register 121 causing the value on 
5 the register bus to be written into-the shadow reset handler register. 

3 The state machine 162 causes the value on the Evjiandle bus 156 to be put onto the register bus. 

4 The state machine 162 asserts the write line 164 of the reset handler register 119 which causes the value on 
w the register bus to be copied into the reset handler register 119. 

Alternatively if a get_iptr_sig is asserted on line 166 from the CPU pipeline 152 then the following occurs The 
state machine 1 62 asserts the read line (FVW) of the reset handler register which causes the value in the reset handler 
register to be read onto the register bus. This value is transferred along the line labelled IPTR. 

15 The following method may be used to boot one or other of the CPUs 1 2 or 1 3 of Figure 1 when the chip is connected 

to an external microcomputer through the port 30 similar to the arrangement shown in Figure 11. The two CPUs 12 
and 13 may be connected to a common suspend pin 118. When pin 116 is asserted, after the hard reset pin 46 has 
been asserted., both CPUs are prevented from attempting to fetch instructions. The external link 30 and external mi- 
crocomputer 123 can then be used to configure the minimal on-chip state by writing directly to control registers on chip 

20 1 1 and storing the necessary boot code into the DRAM memory connected to bus 33 of chip 11 . When the stale of the 
suspend pin is changed one of the CPUs can boot from the code now held in the DRAM for the chip 11 To achieve 
this., the suspend pin 11 8 is changed to an assert state after a hard reset has been asserted. The external microcomputer 
123 sends packets through the port 30 to write boot code into memory 120 shown in Figure 11. The host 123 then 
executes an instruction to send the special event EVENT SET RESET HANDLER to the selected one of microcomputers 

25 1 2 or 1 3 and in this example it will be assumed to be CPU 1 3. This will provide a new target address in the reset handler 
register 119 for CPU 13. The host 113 will then execute an instruction to send through the port 30 a special event 
EVENT SUSPEND to the other CPU 1 2. This will set the suspend flag 1 17 of CPU 12 The assert signal on the suspend 
pin 1 1 8 is then removed so that CPU 1 3 will start executing code derived from memory 1 20 from the target boot address 
held in the reset handler register 119. CPU 1 2 will remain suspended due to the start of its suspend flag 1 1 7. When it 

30 is necessary to operate CPU 12, it can be started by CPU 13 executing an instruction to send to CPU 12 the special 
instruction EVENT SET RESET HANDLER. This will change the default boot address held in the reset handler register 
1 1 9 of the CPU 1 2. CPU 1 3 must then execute an instruction to send the special event EVENT RUN to CPU 1 2 which 
will, as described above, start execution of CPU 12 with code derived from the address in the reset handler register 
119 of CPU 12. 

35 In this way the microcomputer of Figure 1 can be booted without the requirement of having valid code in a ROM. 

Although the above described boot procedure used boot code which had been loaded into the local memory 120 

for the chip 11, the similar procedure may be followed using code located in the memory 125 which is local to the 

external microcomputer 1 23. To achieve this, the same procedure, as above, is followed except that the special event 

which is sent through port 30 to load the reset handler register 119 of CPU 1 3 will provide a target address for the boot 
40 code which is located in the address space of the port 30. In this way when the assert signal is removed from the 

suspend pin 118, CPU 1 3 will start fetching code directly from the external computer and external memory. When CPU 

12 ts needed it can be started by CPU 13 as previously described 

By arranging for the host 113 to send the special instruction EVENT SUSPEND to CPU 12 prior to removing the 

assert signal from suspend pin 11 8 it is possible to reduce the amount of instruction fetching through the port 30 since 
45 CPU 13 may boot alone and then arrange for CPU 12 to boot rather than alternating to boot both CPUs 12 and 13 

from the external microcomputer through the port 30. 

Watchpoint registers may be used to monitor the execution of a program. These registers may be used to initiate 

a debug routine when a particular memory store is addressed or alternatively when instructions from a particular location 

are executed. 

so Various examples of use of the chip 1 1 in a network having a plurality of interconnected chips are shown in Figures 

11 to 14. 

In the example of Figure 11 the chip 11 is shown for simplicity with the single CPU 12 as CPU 1 3 is not involved 
in the operation described with reference to Figure 11. The chip is connected through the external memory interface 
and bus 33 to a memory chip 1 20 which is local to the CPU 12 and forms part of the local address space of the CPU 
55 12. The port 30 is connected by two serial wires 121 and 122 to a further microprocessor chip 123 which in this case 
forms a debugging host for use with chip 11. Line 121 provides a unidirectional input path to chip 11 and line 122 
provides a unidirectional output path to the host 1 23. The host 1 23 is connected tnrough a bus 1 24 to a memory chip 
1 25 which is local to the host microcomputer 1 23 and thereby forms part of the local address space of the host micro- 
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computer 1 23. In order to carry out debugging operations on the CPU 12 : the host microcomputer may operate software 
derived on-chip in the microcomputer 1 23 or from its local memory 1 25 so that the host 1 23 causes special events, as 
previously described, to be issued in packets along the serial line 1 21 through the port 30 onto the P-link system 15. 
These may have the destination address indicating the CPU 12 so that this special event is handled as already de- 
scribed with reference to Figure 10. This may be used to suspend the CPU 12 at any time and to replace the value in 
its reset handler register and to reset 1he CPU 12 either from its previous state or from a new state indicated by the 
value tn the register 119. The CPU 12 may have part of its address space located in addresses of the memory 125 
local to the host 123. The port 30 forms part of the local address space for the CPU 12 and consequently a memory 
access may be made to the address space allocated to the port 30 and in this case the response may be synthesised 
by software running on the host microcomputer 123. It is therefore possible to set the reset handler register 119 to be 
an address local to the host rather than local to the CPU 12. In this way a host can, independently of operation of the 
CPU 1 2, establish itself as the source of the instructions and/or data to be used by the CPU 1 2. This mechanism may 
be used to initiate debugging from the host 123. In the case of a chip 11 having two CPUs 12 and 13, it is possible to 
debug software running on CPU 12 as already explained while leaving software running on CPU 1 3 unaffected by the 
debug operation being carried out on CPU 12. This is the position shown in Figure 12 where the second CPU 13 is 
shown in broken lines and is operating normally in obtaining instructions from its instruction cache or from the memory 
120 quite independently of the debug routine operating on CPU 12 in conjunction with the host 123. 

Figure 1 3 shows an alternative arrangement in which the network is generally similar to that described with refer- 
ence to Figures 11 and 12. However in this case the CPU 12 is provided with a data watchpoint register 130 and a 
code watchpoint register 131 in which respective addresses for data values or instruction locations may be held so as 
to initiate a debug routine if those watchpoints are reached. In this example, the host microcomputer 123 can, at any 
point during the execution of a program by the CPU 12, briefly slop execution of the CPU 12 and cause the watchpoint 
state in the registers 130 or 131 to be modified and return control to the original program of the CPU 12. When the 
CPU 1 2 executes an instruction which triggers a watchpoint as set in either of the registers 1 30 or 1 31 , it stops fetching 
instructions in its normal sequence and starts fetching and executing instructions starting from the instruction specified 
by the content of a debug handler register 132. If the debug handler register 1 32 contains an address which is local 
to the host 123 rather than local to the CPU 12, the CPU 12 will start fetching instructions from the host 123. in this 
way the host can establish the watchpoint debugging of a program which is already running without using any of the 
memory local to the CPU 12 and without requiring tne program of the CPU 12 to be designed in a manner cooperative 
to that of the debugging host 123 In this way the examples described provides for non-cooperative debugging. The 
operating system and application software for the CPUs on the chip 11 do not need to have any knowledge of how the 
debugging host computer 123 will operate or what operating system or software is incorporated in the host 123. 

In conventional computer architectures watchpoint triggers are handled using a vector common to traps or events 
managed by the operating system. These traps and events use a conventional set of registers marked 134 which 
provide the address of the handler routine. In the example described, an extra register set 135 is provided which 
includes the debug handler register 132 and a reset handler register 136. In this manner independence from the op- 
erating system is established by providing the extra register set 135 in which the address of the handler routine for 
watchpoint handling routines may be found. 

Figure 14 shows the same network as previously described with reference to Figure 12. In this case the host 123 
is provided and connected to the port 30 so that it may operate as previously described for use in debugging and the 
transmission of speciaf events through the port 30. However in cases where it is necessary to monitor the debugging 
of one of the CPUs 12 or 1 3 as quickly as possible in debugging real time code, this example may be used to carry 
out debugging of one of the CPUs 12 or 13 by use of the other of the CPUs 12 or 13 instead of the host 123. The 
transfer of packets along the P-link 15 on-chip may be performed faster than external communications through the 
port 30. In this case either of the CPUs 1 2 or 1 3 may execute instructions which send special events to the other CPU 
on the same chip and thereby carry out a debugging operation as previously described with reference to use of the 
host 123 although in this case the control will be carried out by one of the on-chip CPUs in effecting a debugging 
operation of the other CPU on the same chip 

It will be seen that in the above example the external host 123 can be used to carry out debugging of either of the 
on-chip CPUs 12 or 13 without restrictions on the operating systems or application software of either of the on-chip 
CPUs. The watchpoint debugging may be carried out without the need to use memory local to the on<:hip CPUs. Both 
on-chip CPUs 11 and 12 and the host 123 which is externally connected have access to each other's state by packet 
communications through the port 30. The on-chip CPUs 1 2 and 1 3 can access the external memory 125 independently 
of any operation of a CPU in the host 123. This allows the on-chip CPUs to access code from a memory which is local 
to an externally connected microcomputer. 

The external host may comprise a computer or a computer device such as a programmable iogic array. 



10 



EP C 840 221 A1 



Claims 

1 . A computer system including a microprocessor on a single integrated circuit chip comprising an on-chip CPU with 
a plurality of registers a communication bus providing a parallel communication path between said CPU and at 

5 least one other module with logic circuitry connected to said bus, the on-chip circuitry being operable to receive 

on said bus digital data packets including control bits and said module being operable to receive the packet and 
respond thereto independently of operation of the CPU., said integrated circuit device further comprising an externa 
communication port connected to said bus : said port having an internal signal connection to said bus. said internal 
connection having an internal parallel format, an external signal connection having an external format less parallel 

io than said internal format, and translation circuitry to effect conversion of digital signal packets between said internal 

and external format, said input sort being operable independently of operation of said CPU. 

2. A computer system according to claim 1 in which said translation circuitry is arranged to translate bit packets 
between an on-chip bit parallel format and an external bit serial format. 

75 

3. A computer system according to claim 1 or claim 2 in which said on-chip CPU includes circuitry for generating bit 
packets including a destination identifier within each packet, saic translation circuitry being operable to translate 
packets between said internal and external formats while retaining identification of said destination. 

20 4. A computer system according to any one of the preceding claims in which said packets comprise request packets 
for sending from a packet source to a destination, said source and destination being both connected to said com- 
munication bus, and response packets for return Irom a said destination to said source. 

5. A computer system according to any one of the preceding claims in which said packets include packets with a 
25 destination identifier for use on said bus and in addition an address identifior for use within the module forming 

the destination. 

6. A computer system according to any one of the preceding claims in which said integrated circuit chip includes 
destination control circuitry connected to said bus, said control circuitry being operable to decode destination iden- 

30 trfiers within packets on said bus and control supply of the packet to a module in accordance with the decoded 

destination. 

7. A computer system according to claim 6 in which said destination control circuitry includes decode circuitry for 
receiving and decoding a destination identifier in each packet on said bus, and a dedicated communication path 

35 between said destination control circuitry and each module to control access between each module and said bus 

in dependence on operation of said decode circuitry. 

8. A computer system according to any one of the preceding claims in which said single integrated circuit chip has 
a plurality of CPUs on the same chip each connected to said communication bus wherein each CPU on said chip 

40 may address said external port. 

9. A computer system according to any one of the preceding claims in which the or each on-chip CPU has a first 
memory local to the CPU, and an external computer device is connected to said external communication port, said 
external computer device having a second memory local to the external computer device. 

45 

10. A computer system according to claim 9 in which said on-chip CPU includes circuitry for generating bit packets 
including an address identifier related to said second memory. 

11. A method of operating a computer system comprising a microprocessor on a single integrated circuit chip with an 
so on-chip CPU. a plurality of registers, a communication bus providing a parallel communication path between said 

CPU and at least one other module with logic circuitry connected to said bus, and an external communication port 
connected to said bus, said method comprising generating on said bus digital signal packets including control bits, 
receiving at said module a packet on the bus and responding thereto independently of operation of the CPU., said 
method further comprising supplying to said external port a packet having an internal parallel format, and translating 
55 the packet in the port independently of operation of the CPU to an external format less parallel than said internal 

format. 

12. A method according to claim 1 • in which the translation in the port is between an on-chip bit parallel format and 
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an external bit serial format. 

13. A method according to claim 11 or claim 12 in which bit packets are generated including a destination identifier 
and the translation in said external port translates bit packets between said internal and external tormats while 

5 retaining identification of the destination. 

1 4. A method according to any one of claims 1 1 to 1 3 in which said bit packets comprise request packets sent from a 
packet source to a destination via said bus : and response packets returned from said destination to a source of 
the request packet. 

70 

15. A method according to claim 13 in which at least some of said bit packets include in addition to said destination 
identifier an address identifier for use within a module forming the destination. 

16. A method according to any one of claims 11 to 15 in which control circuitry decodes destination identifiers within 
1$ said packets and controls supply of the packets to modules in accordance with the decoded destination. 

17. A method according to claim 1 6 in which said control circuitry provides control signals along dedicated signal paths 
to each module to control access between each module and said bus in dependence on operation of the decode 
circuitry. 

20 

18. A method according to any one of claims 11 to 17 in which a plurality of CPUs are provided on the same chip and 
each may communicate via said bus with said external port. 

19. A method according to any one of claims 11 to 18 in which the or each on-chip CPU may address memory local 
2S to said integrated circuit device or, through said external port, memory local to said external computer device. 
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Fig.3. 
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Fig. 5. 
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Fig. 9 



INSTRUCTION 
CACHE 



I 



42 



PREFETCHER 



101 



ISTREAM 
—112 



-102 



DECODE 
03^ j 



113 



44 



SPECIAL 
EVENT 
LOGIC 



DISPATCHER % 



REGISTER 
FILE 

41 



^105 



LOAD /STORE 

UNIT 
DATA CACHE 



105 



INTEGER 
EXECUTION 
UNITS 



23 



P LINK PORfK 



15 



P-LINK 



17 



EP 0 840 221 A1 




18 



EP 0 840 221 A1 



124 



Fig. 1 1 . 



c 




121 



-1 22 

I BIT SERIAL DEBUG LINK 



30 



DEBUG PORT! 

! 5 



15 



12 




RUN 

SUSPEND 
RESET 

SET.RESET.HANDLER 



EVENT LOGIC 



•119 



RESET.HANDLER 
REGISTER 



44 



DEBUGGERS CPU 




MEMORY 
LOCAL TO 
DEBUGGEE 



33 



20 



11 



Fig. 12. 12 f H0ST 



>*-N MEMORY LOCAL TO 



HOST 



125 



30- 



BIT SERIAL DEBUG LINK 



DEBUG POrT| 



A 
V 



r 



•15 



12 



r 




44 



RUN 

SUSPEND 
RESET 

SET.RESET.HANDLER 



EVENT LOGIC 



r 



119 



RESET.HANDLER 
REGISTER 




!)0 



13 



I 

If 



n 



RUN 
SUSPEND 
RESET 

SET.RESET.HANDLER 



ilk 



r 

I 

lr 
II 
II 



EVENT LOGIC 

'RESET. HANDLER 
REGISTER 



I 

-I I 



DEBUGGEE'S CPU 



UNAFFECTED CPU 



MEMORY 
LOCAL TO 
DEBUGGEE 



44 



20 



11 



119 



19 



EP 0 840 221 A1 



Fig.13. 
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